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As the development of distributed systems progresses, more and more challenges 
', arise and the need for developing optimized systems and for optimizing existing 

' systems from multiple perspectives becomes more stringent. In this paper I 

, present novel algorithmic techniques for solving several optimization problems 

regarding distributed systems with tree topologies. I address topics like: relia- 
^ , bility improvement, partitioning, coloring, content delivery, optimal matchings, 

^ O ' as well as some tree counting aspects. Some of the presented techniques are 

, only of theoretical interest, while others can be used in practical settings. 
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1 Introduction 

Distributed systems are being increasingly developed and deployed all around 
C/2 , the world, because they present efficient solutions to many practical problems. 

^ ' However, as their development progresses, many problems related to scalability, 

fault tolerance, stability, efficient resource usage and many other topics need to 
be solved. Developing efficient distributed systems is not an easy task, because 
many system parameters need to be fine tuned and optimized. Because of this, 
optimization techniques are required for designing efficient distributed systems 
, or improving the performance of existing, already deployed ones. In this paper 

I present several novel algorithmic techniques for some optimization problems 
regarding distributed systems with a tree topology. 

Trees are some of the simplest non-trivial topologies which appear in real- 
OO , life situations. Many of the existing networks have a hierarchical structure (a 

' tree or tree- like graph) , with user devices at the edge of the network and router 

backbones at its core. Some peer-to-peer systems used for content retrieval 
and indexing have a tree structure. Multicast content is usually delivered using 
multicast trees. Furthermore, many graph topologies can be reduced to tree 
' topologies, by choosing a spanning tree or by covering the graph's edges with 

edge disjoint spanning trees [1]. In a tree, there exists a unique path between 
any two nodes. Thus, the network is quite fragile. The fragility is compensated 
by the simplicity of the topology, which makes many decisions become easier. 

This paper is structured as follows. Section 2 defines the main notations 
which are used in the rest of the paper. In Section 3 I consider the minimum 
weight cycle completion problem in trees. In Section 4 I discuss two tree parti- 
tioning problems and in Section 5 I consider two content delivery optimization 
problems. In Section 6 I solve several optimal matching problems in trees and 
powers of trees and in Section 7 I analyze the first fit online coloring heuristic, 
applied to trees. In Section 8 I consider three other optimization and tree count- 
ing problems. In Section 9 I discuss related work and in Section 10 I conclude 
and present future work. 
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2 Notations 



A tree is an undirected, connected, acyclic graph. A tree may be rooted, in 
which case a special vertex r will be called its root. Even if the tree is unrooted, 
we may choose to root it at some vertex. In a rooted tree, we define parent{i) 
as the parent of vertex i and ns{i) as the number of sons of vertex i. For a 
leaf vertex i, ns{i) = and for the root r, parent{r) is undefined. The sons of 
a vertex i are denoted by s{i,j) (1 < j < ns{i)). A vertex j is a descendant 
of vertex i if {parent{j) = i) or parent{j) is also a descendant of vertex i. We 
denote by T(i) the subtree rooted at vertex i, i.e. the part of the tree composed 
of vertex i and all of its descendants (together with the edges connecting them). 
In the paper, the terms node and vertex will be used with the same meaning. 

A matching M of a graph G is a set of edges of the graph, such that any 
two edges in the set have distinct endpoints (vertices). A maximum matching 
is a matching with maximum cardinality (maximum number of edges). 

3 Minimum Weight Cycle Completion of a Tree 

We consider a tree network with n vertices. For m pairs of vertices which 
are not adjacent in the tree, we are given a weight w{i,j) (we can consider 
w{i,j) = +00 for the other pairs of vertices). We want to connect some of these 
m pairs (i.e. add extra edges to the tree), such that, in the end, every vertex of 
the tree belongs to exactly one cycle. The objective consists of minimizing the 
total weight of the edges added to the tree. For the unweighted case {w{i,j) = 1) 
and when we can connect any pair of vertices which is not connected by a 
tree edge, there exists the following simple greedy algorithm [3]. We select an 
arbitrary root vertex r and then traverse the tree bottom-up (from the leaves 
towards the root). For each vertex i we will compute a value representing 
the largest number of vertices on a path P{i) starting at i and continuing in 
T(i), such that every vertex j e (T(i) \ P{i)) belongs to exactly one cycle 
and the vertices in P(i) are the only ones who do not belong to a cycle. We 
denote by e(z) the second endpoint of the path (the first one being vertex i). 
For a leaf vertex i, we have Z(i) = 1 and e(i) = i. For a non-leaf vertex i, 
we first remove from its list of sons the sons s(«, j) with l{s{i,j)) = 0, update 
ns{i) and renumber the other sons starting from 1. If i remains with only 
one son, we set = l{s{i,\)) + 1 and e{i) = e(s(i, 1)). If i remains with 
ns{i) > 1 sons, we will sort them according to the values l{s{i,j)), such that 
l{s{i,l)) < l{s{i,2)) < ... < l{s{i,ns{i))). We will connect by an edge the 
vertices e{s{i, 1)) and e(s(i,2)). This way, every vertex on the paths P{s{i, 1)) 
and P{s{i, 2)), plus the vertex i, belong to exactly one cycle. For the other sons 
s{i,j) (3 < j < ns(i)), we will have to connect s(i, j) to e(s(z, ))). This will only 
be possible if l{s{i,j)) > 3; otherwise, the tree admits no solution. Afterwards, 
we set = 0. If the root r has only one son, then we must have l{r) > 3, such 
that we can connect r to e(r). 

For the general case, I will describe a dynamic programming algorithm (as 
the greedy algorithm cannot be extended to this case). We will again root the 
tree at an arbitrary vertex r, thus defining parent-son relationships. For each 
vertex i, we will compute two values: wA{i)=the minimum total weight of a 
subset of edges added to the tree such that every vertex in T{i) belongs to 
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exactly one cycle, and 'wB{i)=the minimum total weight of a subset of edges 

added to the tree such that every vertex in {T{i) \ {i}) belongs to exactly one 
cycle (and vertex i belongs to no cycle). We will compute the values from 
the leaves towards the root. For a leaf vertex i, we have wA{i) = +00 and 
wB{i) — 0. For a non-leaf vertex i, we have: wB(i) = X^J^i*^ ^^(•^l*; j))- 
order to compute wA(i) we will first traverse T(«) and for each vertex j, we will 
compute wAsum{i,ji)=\hQ sum of all the wA{j)) values, where p is a son of a 
vertex q which is located on the path from i to j {P{i ■ ■ ■ j)) and p does not 
belong to P{i...j). We have wAsum{i,i) = wB(i) and for the other vertices 
j we have wAsum{i,j) = wAsum{i,parent{j)) — wA{j) + wB{j). Now we will 
try to add an edge, such that it closes a cycle in the tree which contains vertex 
i. We will first try to add edges of the form where j is a descendant of i 

(but not a son of i, of course) - these will be called type 1 edges. Adding such an 
edge (i,j) provides a candidate value wcand(i,i, j) for wA{i): wcand{i,i, j) = 
'wAsum{i, j) + 'w{i,j). We will then consider edges of the form {p,q) (p ^ i 
and q ^ i), where the lowest common ancestor of p and q {LCA{p, q)) is vertex 
i - these will be called type 2 edges (we consider every pair of distinct sons 
s{i,a) and s{i,b), and for each such pair we consider every pair of vertices 
p € T{s{i, a)) and q G T{s{i, b)) and verify if the edge (p, q) can be added to the 
tree). Adding such an edge {p,q) provides a candidate value wcand(i,p, q) for 
wA{i): wcand{i,p,q)='wAsum{i,p)+wAsum{i,q)-wB[i)+w{p,q). wA{i) will 
be equal to the minimum of the candidate values wcand{i, *, *) (or to +00 if no 
candidate value exists). We can implement the algorithm in O(n^) time, which 
is optimal in a sense, because w < (n • (n— l)/2 — n+ 1), which is O(n^). wA(r) 
is the answer to our problem and we can find the actual edges to add to the 
tree by tracing back the way the wA{*) and wB{*) values were computed. 

However, when the number m of edges which can be added to the tree 
is significantly smaller, we can improve the time complexity to 0{{n + m) ■ 
log{n)). We will compute for each of the m edges (i,j) the lowest common 
ancestor of the vertices i and j {LCA(i,j)) in the rooted tree. This can 
be achieved by preprocessing the tree in 0{n) time and then answering each 
LCA query in 0(1) time [2]. If LCA{i, j) = fc, then we will add the edge 
(i,j) to a list Ledge{k). Then, for each non-leaf vertex i, we will traverse 
the edges in Ledge{k). For each edge {p,q) we can easily determine if it is 
of type 1 (i = p or I = g) or of type 2 and use the corresponding equa- 
tion. However, we need the values wAsum{i,p) and wAsum{i,q). Instead 
of recomputing these values from scratch, we will update them incrementally. 
It is obvious that tDAsum{parent{i),p)=wAsu'm{i,p)+wB{parent{i))-tuA{i). 
We will preprocess the tree, by assigning to each vertex i its DFS number 
DFSnum{i) {DFSnum{i)=j if vertex i was the j*'* distinct vertex visited dur- 
ing a DFS traversal of the tree which started at the root). Then, for each 
vertex i, we compute D FSmaxii) =the maximum DFS number of a vertex 
in its subtree. For a leaf node i, we have DFSmax{i) = DFSnum{i). For 
a non-leaf vertex i, DFSmax{i)=m,ax{DFSnum,{i). DFSmax{s{i,l)), 
DFSmax{s{i,ns{i)))}. We will maintain a segment tree, using the algorithmic 
framework from [15]. The operations we will use are range addition update and 
point query. Initially, each leaf « (1 < J < n) has a value v{i) = 0. Before com- 
puting wA{i) for a vertex i, we set the value of leaf DFSnum(i) in the segment 
tree to wB{i). Then, for each son s{i,j), we add the value {wB{i) — wA{s{i,j)) 
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to the interval [DFSnum{s{i,j)), DFSmax{s{i,j))] (range update). We can 
obtain wAsu'm{i,p) for any vertex p £ T{i) by querying the vahic of the cell 
DFSnum{p) in the segment tree: we start from the (current) value of the leaf 
DFSnum{p) and add the update aggregates uagg stored at every anestor node 
of the leaf in the segment tree. Queries and updates take 0{log{n)) time each. 

If the objective is to minimize the largest weight Wmax of an edge added to 
the tree, we can binary search Wmax and perform the following feasibility test 
on the values Wcand chosen by the binary search: we consider only the "extra" 
edges with w{i,j) < Wcand and run the algorithm described above for 

these edges; if wA{r) ^ +00, then Wcand is feasible. 

4 Tree Partitioning Techniques 

4.1 Tree Peirtitioning with Lower and Upper Size Bounds 

Given a tree with n vertices, we want to partition the tree into several parts, 
such that the number of vertices in each part is at least Q and at most k ■ Q 
{k > 1). Each part P must have a representative vertex u, which does not 
necessarily belong to P. However, (P U {u}) must form a connected subtree. I 
will present an algorithm which works for A; > 3. We root the tree at any vertex 
r, traverse the tree bottom- up and compute the parts in a greedy manner. For 
each vertex i we compute w{i)=the size of a connected component C(i) in T{i), 
such that vortex i G C{i) , |C(i)| < Q, and all the vertices in (T(z) \ C(i)) were 
split into parts satisfying the specified properties. For a leaf vortex i, w{i) = 1 
and C{i) = {i}. For a non-leaf vertex i, we traverse its sons (in any order) and 
maintain a counter ws(i)=tho sum of the w{s(i, j)) values of the sons traversed 
so far. liws{i) exceeds Q — 1 after considering the son s{i,j), we form a new part 
from the connected components C(s{i, lastson + 1)), . . . , C{s{i, j)) and assign 
vertex i as its representative. Then, wc reset w.s{i) to 0. {last_son < j) is the 
previous son where ws{i) was reset to (or 0, if ws{i) was never reset to 0). 

After considering every son of vertex i, we set w(i) = ws{i) + 1 and the 
component C{i) is formed from the components C{s{i, j)) which were not used 
for forming a new part, plus vertex i. U ws{i) + 1 = Q, then we form a new 
part from the component C{i) and set wli) = and C{i) = {}. During the 
algorithm, the maximum size of any part formed is 2 • Q — 2. At the end of the 
algorithm, we may have that w{r) > 0. In this case, the vertices in C(r) were 
not assigned to any part. However, at least one vertex from C(r) is adjacent to 
a vertex assigned to some part P. Then, we can extend that part P in order 
to contain the vertices in C(r). This way, the maximum size of a part becomes 
3 ■ Q — 3. The pseudocode of the first part of the algorithm is presented below. 
In order to compute the parts, we maintain for each vertex i a value part{i), 
which is 0, initially (0 means that the vertex was not assigned to any part). 
In order to assign distinct part numbers, we will maintain a global counter 
partjnumber, whose initial value is 0. The first part of the algorithm has linear 
time complexity (0(n)). The second part (adding C(r) to an already existing 
part) can also be performed in linear time, by searching for an edge {p, q), such 
that part{p) = and part{q) > (there are only n — 1 = 0{n) edges in a tree). 

LowerUpperBoundTreePartitioning(Q, i): 

if (ns(i)=0) then w(i)=l else 
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ws(i)=lasLson=0 

for j=l to ns(i) do // j=l,2,. . . ,ns(i) 

LowerUpperBoundTreePartitioning (Q, s(i,j)) 

■ws(i)=ws(i)+w(s(i,j)) 

if {ws{i) > Q) then 
part-number=part-number + 1; last-Son=j; ws(i)=0 
for k=last-Son+l to j do AssignPartNumber('5('i,A;^, part-number) 
w(i)=ws(i)+l 
if {w{i) > Q) then 

parLnuTnber=part-nuTnber + 1; w(i)=0 

AssignPartNumber('i, part-number) 

AssignPartNumber(i, part_number): 

if {part{i) ^ 0) then return() 
part{%)=part-number 

for j=l to ns{i) do AssignPartNumberf^sf^zj^, part-number) 
4.2 Connected Tree Pcirtitioning 

I will now present an efficient algorithm for identifying k connected parts of given 
sizes in a tree (if possible), subject to minimizing the total cost. Thus, given a 
tree with n vertices, we want to find k vertex-disjoint components (called parts), 
such that the i^^ part (1 < ? < fc) has sz{i) vertices (.sz(l)-|-sz(2)-|-. . .-|-sz(fc) < n 
and sz{i) < sz{i + 1) for 1 < i < A; — 1). Each tree edge has a cost ce{i,j) 
and each tree vertex i has a cost cv{i). We want to minimize the sum of the 
costs of the vertices and edges which do not belong to any part. An edge (z, j) 
belongs to a part p if both vertices i and j belong to part p. 

In order to obtain k connected components of the given sizes we need to keep 
Q — k edges of the tree and remove the others, where Q = sz{l) + . . . + sz{n). 
We could try all the ({n — 1) choose {Q — k)) possibilities of choosing Q — k edges 
out of the n—1 edges of the tree. For each possibility, we obtain k' = n — Q + k 
connected components with sizes sz'{l) < sz'{2) < ... < sz'{k'); in case of 
several components with equal sizes, we sort them in increasing order of the 
total cost of the vertices in them. Then, we must have sz{j) = sz'{k' — k + j) 
and the total cost of the possibility is the sum of the costs of the removed edges 
plus the sum of the costs of the vertices in the components 1,2, . . . ,k' — k (which 
should have only one vertex each, if the size conditions hold). However, this 
approach is quite inefficient in most cases. I will present an algorithm with time 
complexity 0{n^ ■ 3'^). We root the tree at an arbitrary vertex r. Then, we 
compute a table Cmin{i,j,S)=the minimum cost of obtaining from T{i) the 
parts with indices in the set S and, besides them, we are left with a connected 
component consisting of j vertices which includes vertex i and, possibly, several 
vertices which are ignored (if j = 0, then every vertex in T{i) is assigned to one 
of the parts in S or is ignored). We compute this table bottom- up: 

ConnectedTreePartitioning(i) : 

for each S' C {1, 2, . . . , A:} do for j^O to n do Cmin(i,j,S) =+oo 
Cmin(i, 1, {))=0; Cmin(i, 0, {})=cv(i) 
for x=l to ns(i) do 

ConnectedTreePartitioning (s(i,x)) 

for each 5 C {1, 2, . . . , fc} do for j=0 to n do 
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Caux ( S)= Cmin ( S); Cmin (i,j,S) =+ oo 
for each 5* C {1, 2, . . . , fc} do for i=0 to n do 
for each W C 5 do for q=0 to qlimit(j) do 
Cmin (i,j,S)=min{ Cmin (i,j,S), Caux(i,j-q,S \W) + extra-Cost(i,s(i,x),q) 
+ Cmin(s(i,x),q,W)} 
for each S C {1, 2, . . . ,k} do 
for j=0 to n do if (Cmin{i,j, S) < +oo) then 
for q=l to k do if ((j=sz(q)) and (q ^ S)) then 
Cmin(i,0,S U {q})—min{Cm,in(i,j,S), Cmin(i,0,S U {q})} 

We define extra-Cost(i, son-xJ, q)=if {q > 0) then return(O) else return(ce(i, 
sou-xJ)) and qlimit(j)=max{j-l,0}. The algorithm computes Cmin(i,*,*) from 
the vahics of vertex i's sons, using the principles of tree knapsack. The total 
amount of computations for each vertex is 0{ns{i)-3'' -n^). Summing over all the 
vertices, we obtain 0(n^-3'^). The minimum total cost is Cmin{r, 0, {1, 2, ... , k}) 
(if this value is +oo, then we cannot obtain k parts with the given sizes). In 
order to find the actual parts, we need to trace back the way the Cmin{*, *, *) 
values were computed, which is a standard procedure. When the sum of the 
sizes of the k parts is n, then every vertex belongs to one part. 

5 Content Delivery Optimization Problems 
5.1 Minimum Number of Unicast Streams 

We consider a directed acyclic graph G with n vertices and m edges. Every 
directed edge {u,v) has a lower bound lbeG{u,v), an upper bound ubeG{u,v) 
and a cost ceciu^v). Every vertex u has a lower bound Ibvciu), an upper 
bound ubvoiu) and a cost cvq{v). We need to determine the minimum number 
of unicast communication streams p and a path for each of the p streams, such 
that the number of stream paths npe{u,v) containing an edge {u,v) satisfies 
lbeQ{u,v) < npe{u,v) < ubeQ{u,v) and the number of paths npv{u) containing 
a vertex u satisfies Ibvciu) < npv{u) < ubvciu). Each vertex u can be a source 
node, a destination node, both or none. A stream's path may start at any 
source node and finish at any destination node. Moreover, for the number of 
streams p, we want to compute the paths such that the sum S over all the values 
{npe{u,v) — lbeG{u,v))-ceG{u,v) and {npv{u) — lbvG{u))-cvG{u) is minimum. 

Particular cases of this problem have been studied previously. When IbvQiu) 
=1 and ubvciu) = 1 for every vertex u, lbeG{u,v) = and ubeG{u,v) = +oo 
for every directed edge {u,v), all the costs are 0, and every vertex is a source 
and destination node, we obtain the minimum path cover problem in directed 
acyclic graphs, which is solved as follows [18]. Construct a bipartite graph B 
with n vertices x\,. . . ,Xn on the left side and n vertices y\,. . . ,yn on the right 
side. We add an edge {xi,yj) in B if the directed edge (z, j) appears in G. Then, 
we compute a maximum matching in B. If the cardinality of this matching is C, 
then we need p = n — C streams. The paths are computed as follows. Having an 
edge {xi,yj) in the maximum matching means that the edge {i,j) in G belongs 
to some stream's path. If two edges (xi,yj) and {xj,yk) in B belong to the 
matching, then the edges {i,j) and {j,k) in G belong to the path of the same 
stream. For non-zero costs, wc compute a minimum (total) weight matching in 
B (where every edge {xi,yj) has a weight equal to ce{i,j)). 
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In order to solve the problem I mentioned, we will use a standard trans- 
formation and construct a new graph G' where every vertex u is represented 
by two vertices Uin and Uout- For every directed edge {u,v) in G, we add an 
edge {uoutiVin) in G' , with the same cost and lower and upper bounds. We 
also add a directed edge from Uin to Uout in G' (for every vertex u in G), with 
cost cvg{u)i lower bound Ibvciu) and upper bound ubvciu). Then we add two 
special vertices s (source) and t (sink) to G'. For every source node u in G, we 
add a directed edge (s, ?ij„) in G', with lower bound and cost and upper bound 
+CXD. For every destination node v in G, we add a directed edge {vout,t), with 
lower bound and cost and upper bound +00. We also add the edges (s, t) and 
(t, s) with lower bound and cost and upper bound +00. The resulting graph 
G' has costs, lower and upper bounds only on its edges and not on its vertices. 
In order to compute the minimum number of communication streams which 
satisfy the constraints imposed by G, it is enough to compute a (minimum cost) 
minimum feasible flow in G', from s to t. Decomposing the flow into unit-flow 
paths (in order to obtain the path of each communication stream) can then be 
done easily. We repeatedly perform a graph traversal (DFS or BFS) from s to 
t in G', considering only directed edges with positive flow on them. From the 
traversal tree, by following the "parent" pointers, we can find a path P from s 
to t, containing only edges with positive flow. We compute the minimum flow 
fP on any edge of P, transform P into fP unit paths and then decrease the 
flow on the edges in P by fP. If we remove the first and last vertices on any 
unit path (i.e. s and t), we obtain a path from a vertex to a vertex Vgut, 
where u is a source node in G and w is a destination node in G. We will use 
the algorithm presented in [18] for determining a feasible flow (not necessarily 
minimum) in a flow network with lower and upper bounds on its edges. We will 
denote this algorithm by A(F, s, t) (F is the flow network given as argument, s 
is the source vertex and t is the sink vertex). I will describe A{F, s, t) briefly. 
We construct a new graph F' from F, as follows. We maintain all the vertices 
and edges in F . For every directed edge (u, v) in F, the directed edge (u, v) in 
F' has the same cost, lower bound and upper bound (u6eir(u,f) — lbeF(u,v)). 
We add two extra vertices s' and f/ and the following zero-cost directed edges: 
(s', u) and (u, t') for every vertex u in F (including s and t). The lower bound of 
every edge will be 0. The upper bound of a directed edge (s', u) in F' is equal to 
the sum of the lower boimds of the directed edges (*, u) in F. The upper bound 
of every directed edge (u, t') in F' is equal to the sum of the lower bounds of the 
directed edges (u, *) in F. The algorithm A{F, s, i) computes a minimum cost 
maximum flow g in the graph F' (which, as stated, only has upper bounds); if 
all the costs are 0, only a maximum flow is computed. If g is equal to the sum of 
the upper bounds of the edges (s', *) (or, equivalently, of the edges (*, t')). then 
a feasible flow from s to i exists in F: the flow on every directed edge {u, v) in 
F will be Ibepiu, v) plus the flow on the edge (u, v) in F' . 

We will first run the algorithm on G' (i.e. call A{G' , .s, t)) in order to verify 
if a feasible flow exists). If no feasible flow exists, then the constraints cannot 
be satisfied by any number of streams. Otherwise, we construct a graph G" 
from G', by adding a new vertex snew and a zero-cost directed edge (snew, s) 
with lower bound and upper bound x. snew will be the new source vertex 
and a; is a parameter which is used in order to limit the amount of flow entering 
the old source vertex s. We will now perform a binary search on x, between 
and gmax, where gmax is the value of the feasible flow computed by calling 
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A{G', s, t). The feasibility test consists of verifying if there exists a feasible flow 

in the graph G" (i.e. calling A{G" , sneAii, t)). The minimum value of x for which 
a feasible flow exists in G" is the value of the minimum feasible flow in G", from 
s to t. Obtaining the feasible flow in G' from the feasible flow in G" is trivial: 
for every directed edge {u,v) in G' , wc set its amount of flow to the flow of 
the same edge {u,v) in G". The time complexity of the presented algorithm is 
0{MF{n,m) ■ log{gmax)), where gmax is a good upper bound on the value of 
a feasible flow and MF{n, m) is the best time complexity of a (minimum cost) 
maximum flow algorithm in a directed graph with n vertices and m edges. 

5.2 Degree-Constrained Minimum Spanning Tree 

In [13], the following problem was considered: given an undirected graph with 
n verices and m edges, where each edge (i, j) has a weight w(i,j) > 0, compute 
a spanning tree MST of minimum total weight, such that a special vertex r has 
degree exactly k in MST. A solution was proposed, based on using a parameter 
d and setting the cost of each edge (r, j) adjacent to r, c{r,j) = d + w{r,j); 
the cost of the other edges is equal to their weight. Parameter d can range 
from —DO to +00. We denote by MST{d)=the minimum spanning tree using 
the cost functions defined previously. When d = — oo, MST{d) contains the 
maximum number of edges adjacent to r. For d = +oo, MST{d) contains 
the minimum number of edges adjacent to r. We define the function ne(d.)=\\ic 
number of edges adjacent to r in MST{d). ne{d) is non-increasing on the interval 
[—00, +oo]. We will binary search the smallest value dopt of the parameter d in 
the interval [— oo, +oo], such that ne{dopt) < k. We will finish the binary search 
when the length of the search interval is smaller than a small constant £ > 0. 

If ne(dopt) = k, then the edges in MST (dopt) form the required minimum 
spanning tree. If ne(dopt) < fc, then ne{dopt — e) > k. We define S{d)=the 
set of edges adjacent to vertex r in MST(d). It is easy to prove that S{dopt) 
is included in S{dopt — s). The required minimum spanning tree is constructed 
in the following manner. The edges adjacent to vertex r will be the edges in 
S{dopt), to which we add {k — ne{dopt)) arbitrary edges from the set S{dopt — 
e) \ S{dopt). Once these edges are fixed, we construct the following graph G: 
we set the cost of the chosen edges to and the cost of the other edges (i, j) 
to w{i,j). We now compute a minimum spanning tree MSTq in G. The edges 
in MSTg are the edges of the minimum spanning tree of the original graph, in 
which vertex r has degree exactly k. The time complexity of this approach is 
0{m ■ logim) ■ log{DMAX)), where DMAX denotes the range over which we 
search the parameter d. When m is not too large (i.e. m is not of the order 
0(n^)), this represents an improvement over the O(n^) solution given in [13]. 

6 Matching Problems 

6.1 Mctximum Weight Matching in an Extended Tree 

Let's consider a rooted tree (with vertex r as the root). Each vertex i has a 

weight w{i). We want to find a matching in the following graph G (extended 
tree), having the same vertices as T and an edge {x,y) between two vertices x 
and y, if: (i) x and y are adjacent in the tree; (it) x and y have the same parent 
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in the tree. The weight of an edge {x,y) in G is \w{x) — w{y)\. The weight 

of a matching is the sum of the weights of its edges. Wc arc interested in a 
maximum weight matching in the graph G. For each vertex i, we sort its sons 
s{i, 1), . . . , s{i, ns{i)) in non-decreasing order of their weights, i.e. w{s{i, 1)) < 
... < w(i,ns{i)). We will compute for each vertex i two values: A{i)=the 
maximum weight of a matching in T{i) if vertex i is the endpoint of an edge 
in the matching and B{i)=ih.e maximum weight of a matching in T{i) if vertex 
i is not the endpoint of any edge in the matching. In order to compute these 
values, we will compute the following tables for every vertex i: GA{i,j, /c)=the 
maximum weight of a matching in T{i) if vertex i is the endpoint of an edge in 
the matching and we only consider its sons s{i, j), s{i,j+l), . . . , s{i, k) (and their 
subtrees). Similarly, we have CB{i,j, k), where vertex i does not belong to any 
edge in the matching. The maximum weight of a matching is max{A{r), B{r)}. 
The actual matching can be computed easily, by tracing back the way the A(i), 
B{i), CA{i,*,*) and CB{i,*,*) values were computed. A recursive algorithm 
(called with r as its argument) is given below. The time complexity is 0{ns{i)'^) 
for a vertex i and, thus, O(n^) overall. 

MaximumWeightMatching-ExtendedTree(i): 

if (ns(i)=0) then A(i)=B(i)=0 else 
for j=l to ns(i) do MaximumWeightMatching-ExtendedTree(s('yyj 

for j=l to ns(i) do 
CA(i, J, J - 1)= -w; CA{i,],j) = \w{i) - w{s{i,j))\+B{s{i,3)) 
CB(i, j, j - 1)= 0; CB(i, J, :j)=max{A(s(i,j)), B(s(i,j))] 

for count=l to (ns(i)-l) do for j=l to (ns(i)-count) do 

k = j + count 

CA(i,j,k)=max{\2u{s{i, j)) ~w{s{i,k))\ + B(s(i,j)) + B(s(i,k)) + CA(i, j 
+ 1, k - 1), \w{i) -w{s{i,i))\ + B(s(i,i)) + CB(i, k), \w{i) -w{s{i,k))\ 

+ B(s(i,k)) + CB(i, j, k-1), max{A(s(i,j)), B(s(i.j))} + CA(i. k), max{ 

A(s(i,k)), B(s(i,k))] + CA(i, J, k-1)] 

CB(i,j,k)=max{\w{s{i, j)) - w{s{i, k))\ + B(s(i,j)) + B(s(i,k)) + CB(i, j + 
1, k - 1), max{A(s(i,j)), B(s(i,j))} + CB(i, j+l, kj, max{A(s(i,k)), B(s(i,k))} 
+ CB(i, ], k-1)} 

A (i)=CA (i, l,ns(i)); B(i)=CB(i, l,ns(i)) 

6.2 McLximum Matching in the Power of a Graph 

The A;*'* power G*^ {k > 2) of a graph G is a graph with the same set of vertices 
as G, where there exists an edge {x,y) between two vertices x and y if the 
distance between x and ?/ in G is at most k. The distance between two vertices 
{x, y) in a graph is the minimum number of edges which need to be traversed 
in order to reach vertex y, starting from vertex x. A maximiim matching in G^ 
of a graph G can be found by restricting our attention to a spanning tree T of 
G. The following linear algorithm (called with i = r), using observations from 
[12], solves the problem (we consider that, initially, no vertex is matched): 

MaximumMatchingGk(i) : 
if (ns(i)=0) then return() else 

lasLson=0 

for j=l to ns(i) do // j=l,2,. . . ,ns(i) 
MaximumMatchingGk (s(i,j)) 
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if (not matched(s(i,j)) then 
if (last_son = 0) then last_son = s(i,j) else 
add edge (lastson, s(i,j)) to the matching 
matched(lasLson) = matched(s(i,j)) = true; lastson = 
if (last_son > 0) then 
add edge (i, lastson) to the matching 
■matched(i) = matchedflastson) = true 

7 First Fit Online Tree Coloring 

A very intuitive algorithm for coloring a graph with n vertices is the first-fit on- 
line coloring heuristic. We traverse the vertices in some order v{l),v{2), . . . , v{n). 
We assign color 1 to v{l) and for « = 2, . . . , n, wc assign to v{i) the minimum 
color c{i) > 1 which was not assigned to any of its neighbours v{j) {j < i). 

A tree is 2- colorable: we root the tree at any vertex r and then compute for 
each vertex i its level in the tree (distance from the root) ; wc assign the color 1 
to the vertices on even levels and the color 2 to those on odd levels. However, 
in some situations, we might be forced to process the vertices in a given order. 
In this case, it would be useful to compute the worst-case coloring that can be 
obtained by this heuristic, i.e. the largest number of colors that are used, under 
the worst-case ordering of the tree vertices {Grundy number). I will present an 
0{n ■ log{log{n))) algorithm for this problem, similar in nature to the linear 
algorithm presented in [4]. For each vertex i, we will compute cmax{i)=the 
largest color the can be assigned to vertex i in the worst-case, if vertex i is the 
last vertex to be colored. The value max{cmax(i)\l < i < n} is the largest 
number of colors that can be assigned by the first fit online coloring heuristic. 

We will root the tree at an arbitrary vertex r. The algorithm consists of 
two stages. In the first stage, the tree is traversed bottom-up and for each 
vertex i we compute c(l,i)=the largest color that can be assigned to vertex 
i, considering only the tree T{i). For a leaf vertex i, we have c{l,i) = 1. 
For a non-leaf vertex i, we will sort its sons s{i,l), . . . , s{i,ns{i)), such that 
c(l,s(i,l)) < c(l,s(i,2)) < ... < c{l,s{i,ns{i))). We will initialize c{l,i) to 
1 and then consider the sons in the sorted order. When we reach son s{i,j), 
we compare c(l,s(i,j)) with c{l,i). If c(l,s(i,j)) > c(l, «), then we increment 
c(l, i) by 1 (otherwise, c(l, i) stays the same). The justification of this algorithm 
is the following: if a vertex i can be assigned color c{l,i) in some ordering of 
the vertices in T(i), then there exists an ordering in which it can be assigned 
any other color c', such that 1 < c' < c(l,i). Then, when traversing the sons 
and reaching a son s{i,j) with c{l,s{i,j)) > c{l,i), we consider an ordering of 
the vertices in T{s{i, j)), where the color of vertex s{i,j) is c(l,i); thus, we can 
increase the maximum color that can be assigned to vertex i. 

After the bottom-up tree traversal, wc have cmax{r) — c(l,r), but we still 
have to compute the values cmax{i) for the other vertices of the tree. We 
could do that by rooting the tree at every vertex i and running the previously 
described algorithm, but this would take 0(n^ • log{log{n))) time. However, we 
can compute these values faster, by traversing the tree vertices in a top-down 
manner (considering the tree rooted at r). For each vertex i, we will compute 
colmax{parent{i),i)=th.e maximum color that can be assigned to parent{i) if we 
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remove T{i) from the tree and afterwards we consider parent{i) to be the (new) 

root of the tree. We will use the values c(2, i) as temporary storage variables. 
c{2,i) is initialized to c(l,i), for every vertex i. When computing cmax{i), we 
consider that vertex i is the root of the tree. Let's assume that we computed 
the value cmax{i) of a vertex i and now wc want to compute the value cmax{j) 
of a vertex j which is a son of vertex i. We remove j from the list of sons of 
vertex i and add parent{i) to this list {parent{i)=YeTtex i's parent in the tree 
rooted at the initial vertex r) . Wc now need to lift vertex j above vertex i and 
make j the new root of the tree. In order to do this, we will recompute the value 
c(2,z), which is computed similarly to c{l,i), except that we consider the new 
list of sons for vertex i (and their c(2, *) values). Afterwards, we add vertex i 
to the list of sons of vertex j. We will compute the value cmax{j) similarly to 
the value c(l, j), using the values c(2, *) of vertex j's sons (instead of the c(l, *) 
values of the sons). After computing cm,ax(j) we restore the lists of sons of 
vertices i and j to their original states (as if the tree were rooted at the initial 
vertex r). After computing the values cmax{u) of all the descendants u of a 
vertex j, we reset the value c(2, j) to c(l, j). 

Both traversals take 0{n ■ log(n)) time, if we sort the ns(i) sons of every 
vertex i in 0{ns{i) ■ log{ns{i))) time. However, it has been proved in [4] that 
the minimum number of vertices of a tree with the Grundy number q is 2''~^, 
which is the binomial tree B{q — 1). The binomial tree B{0) consists of only 
one vertex. The binomial tree B{k > 1) has a root vertex with k neighbors; the 
i*^ of these neighbors (0 < « < fc — 1) is the root of a B{i) binomial tree. Thus, 
every value c(l, *), c(2, *) and cmax{*) can be represented using O {log (login))) 
bits. We can use radix-sort and obtain an 0{n ■ log{log{n))) time complexity. 
The pseudocode of the functions is given below. The main algorithm consists of 
calling FirstFit-BottomUp(r), initiaUzing the c(2, *) values to the c(l, *) values, 
setting cmax{r) = c(l,r) and then calling FirstFit-TopDown(r) 

Compute(i, idx): 

sort the sons of vertex i, such that c(idx,s(i,l))< . . . <c(idx,s(i,ns(i))) 
c(idx,i)=l 

for j=l to ns(i) do if (c(idx,s(i,j))>c(idx,i)) then c(idx,i)=c(idx,i)+l 
FirstFit-BottomUp(i): 

for j=l to ns(i) do FirstFit-BottomUpi'sCijjj 
Compute (i, 1) 

FirstFit-TopDown(i) : 
if {i 7^ r) then 

remove vertex i from the list of sons of parent (i) 

add parent (parent (i)) to the list of sons of parent (i) (if parent (i) ^ r) 

Compute (parent(i), 2); colmax(parent(i),i)=c(2,parent(i)) 

add parent(i) to the list of sons of vertex i 

Compute fij^j; cmax(i)=c(2,i) 

restore the original lists of sons of the vertices parent(i) and i 
for j=l to ns(i) do FirstFit-TopDownCs(ij^^ 

c(2,i)^c(l,i) 
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8 Other Optimization and Counting Problems 



8.1 Building a (Constrained) Tree with Minimum Height 

In this subsection I consider the fohowing optimization problem: We are given 
a sequence of n leaves and each leaf i {I <i <n) has a height h{i). We want 
to construct a (strict) binary tree with n — 1 internal nodes, such that, in an 
inorder traversal of the tree, we encounter the n leaves in the given order. The 
height of an internal node i is h{i) = l+max{h{leftson{i), h{rightson{i))} (the 
height of the leaves is given) . Wc arc interested in computing a tree whose root 
has minimum height. A straight-forward dynamic programming solution is the 
following: compute Hmin{i,j)=the minimum height of a tree containing the 
leaves i, i + 1, We have: Hmin{i, + mini<k<j-i max{Hmin{i, k), 

Hmin{k + l,i)}- Hmin{l,n) is the answer to our problem. However, the time 
complexity of this algorithm is O(n^), which is unsatisfactory. An optimal, 
linear-time algorithm was given in [14]. The main idea of this algorithm is the 
following. We traverse the leaves from left to right and maintain information 
about the rightmost path of the optimal tree for the first i leaves. Then, we 
can add the (i + 1)** leaf by modifying the rightmost path of the optimal tree 
for the first i leaves. Let's assume that we processed the first i leaves and the 
optimal tree for these leaves contains, on its rightmost path, the vertices v{l), 
v{2), v{nv{i)), in order, from the root to the rightmost leaf (w(l) is the 
root). Let's assume that the heights of the subtrees rooted at these vertices 
are hv{l), hv{nv{i)). It is easy to build this tree for i = 1 and i = 2 
(it is unique). When adding the {i + 1)^* leaf, we traverse the rightmost path 
from nv{i) down to 2. Assume that we are considering the vertex v{j). If 
hv{j — 1) < (2 + max{hv{j),h{i + 1)}), then we disconsider the vertex v{j) 
from the rightmost path and move to the next vertex {v{j — 1)). Let's assume 
that the path now contains the vertices v{l), . . . , v(nv'{i)). We replace vertex 
v{nv'{i)) by a new vertex vnew, whose left son will be v{nv'{i)) (together with 
its subtree) and whose right son will be the {i + 1)** leaf. The height of the new 
vertex will be 1 + •max{hv{nv' (i)) , h(i + 1)}. The rightmost path of the optimal 
tree behaves like a stack and, thus, the overall time complexity is linear. 

I will present a sub-optimal 0{n ■ log{n)) time algorithm which is interest- 
ing on its own. The algorithm is similar to Huffman's algorithm for computing 
optimal prefix-free codes, except that it maintains the order of the leaves. A sug- 
gestion that such an approach might work was given to me by C. Ghcorghc. At 
step i {1 < i < n — 1) oi the algorithm, we will have n~i + l subtrees of the opti- 
mal tree. Each subtree j contains an interval of leaves [leftleaf{j), rightleaf{j)] 
and its height is h{j). We will combine the two adjacent subtrees j and j -I- 1 
whose combined height (l+m,ax{height(suhtree j) .heAght( subtree is min- 

imum among all the 0(n) pairs of adjacent subtrees. At the first step, the n 
subtrees arc represented by the n leaves, whose heights arc given. A straight- 
forward implementation of this idea leads to an O(n^) algorithm. However, the 
processing time can be improved by using two segment trees [15], A and B, with 
n and n — 1 leaves, respectively. Each node q of a segment tree corresponds to an 
interval of leaves [left{q),right{q)] (leaves are numbered starting from 1). Each 
leaf node of the segment tree A can be in the active or inactive state. Each node 
g of A (whether leaf or internal node) maintains a value nactive(q), denoting 
the number of active leaves in its subtree. Initially, each of the n leaves of A 
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is active and the nactive{*) values are initialized appropriately, in a bottom-up 

manner (1, for a leaf node, and nactive{leftson{q)) + nactive{rightson{q)), for 
an internal node q). Segment tree B has n — 1 leaves and each node of B (leaf 
or internal node) stores a value he. If leaf i (1 < i < n — 1) is active in A, then 
hc(leaf i)=l+max{h(i), h(j)}, where j > i is the next active leaf. If leaf i is not 
active in A or is the last active leaf, then he(leaf i)=+oo. The value he of each in- 
ternal node g' of -B is the minimum among all the he values of the leaves in node 
g's subtree, i.e. hcfnode q)=m,in{hc(leftson(q)), hc(rightson(q))}. Moreover, 
each node q of B maintains the number Inum of the leaf in its subtree which 
gives the value he(node q). We have Inumfleaf i)=ia.nd Inumfinternal node q)=if 
(hc(leftson(q)) < hc(rightson(q))) then Inum (lefts on (q)) else lnum(rightson(q)). 

At each step i (1 < i < n— 1), each active leaf is the leftmost leaf of 
a subtree of the optimal tree. After every step, the number of active leaves 
decreases by 1. We can find in 0{log{n)) time the pair of adjacent subtrees 
to combine. The height of the combination of these subtrees is hc(root node of 
B), the leftmost leaf of the first subtree is i=lnum(root node of B) and that of 
the second subtree is j = next_active{i) . We define the function next-active 
by using two other functions: rank{i) and unrank{r). rank{i) returns the 
number of active leaves before leaf i {0 < rank{i) < nactive(root node of A)-l). 
unrank{r) returns the index of the leaf whose rank is r. The two functions are 
inverses of each other: unrank(rank{i)) = i and rank{unrank{r)) = r. We 
have rank(i)=rank'(i, root node of A), unrank(r)=unrank'(r, root node of A) 
and next_active(i)=unrank(rank(i) + 1). 

rank'(i, q): 

ii (q is a leaf node) then 

if (left(q)=right(q)^i) then returnCO^ else returnf'-J^ 
else if (i > right{leftson(q)) ) then 

return ( nactive(leftson( q) ) +rank '( i, rightson(q) ) ) 
else retuvTa.(rank'(i, leftson(q))) 

unrank'(r, q): 

\i (q is a leaf node) then 

if (r > 0) then vetvLvr\.(-l) else r etum (left (q)) 
else if {nactive{leftson{q)) < r) then 

return(unrank '( r-nactive(leftson( q) ), rightson( q))) 
else retuvTa.(unrank'(r, leftson(q))) 

The functions rank, unrank and next-active take 0{log{n)) time each. After 
obtaining the indices of the two active leaves * and j whose corresponding sub- 
trees are united (by adding a new internal node whose left son is the root of i's 
subtree and whose right son is the root of j's subtree), we mark leaf j as inac- 
tive. We do this by traversing the segment tree A from leaf j towards the root 
(from j to parent{j), parent{parent{j)), . . . , root node of A) and decrement by 
1 the nactive values of the visited nodes. Then, we change the h values of leaves 
i and j. We set h(i)=hc(root node of B) and h{j) = +oo. After this, we will 
also change the he values associated to the leaves i and j in the segment tree 
B. The new he value of leaf j will be -|-oo. If i is now the last active leaf, then 
hc(leaf i) becomes +00, too. Otherwise, let j' = next-active{i) , the next active 
leaf after i (at this point, leaf j is not active anymore). We will change he(leaf 
node i) to (1 -|- max{h{i), h{j')}). After changing the he value of a leaf k, we 
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traverse the tree from leaf k towards the root (visiting all of fc's ancestors, in 

order, starting from parent{k) and ending at the root of B). For each ancestor 
node q, we recompute hc(node q) as min{hc{leftson(q)),hc{rightson{q))}. 

8.2 The Number of Trees with a Fixed Number of Leaves 

In order to compute the number of labeled trees with n vertices and exactly p 
leaves, we will compute a table NT{i,j)=the number of trees with i vertices and 
exactly j leaves {I < j < i < n). Obviously, we have NT{1, 1) = iVT(2, 2) = 1 
and NT{i,j) = for « = 1,2 and j ^ i. For i > 2, we have NT{i,i) = and 
for 1 < j < i — 1, we will proceed as follows. The j leaves can be chosen in 
C{i,j) ways {i choose j). After choosing the identifiers of the j leaves, we will 
conceptually remove the leaves from the tree, thus remaining with a tree having 
i— j vertices and any number of leaves k {1 < k < j). Each of the j leaves that 
we conceptually removed is adjacent to one of these k vertices. Furthermore, 
each of these k vertices is adjacent to at least one of the j leaves from the 
larger tree. Thus, we need to compute the number of surjective functions / 
from a domain of size j to a domain of size k. We will denote this value by 
NF{j,k). This is a "classical" problem, but 1 will present a simple solution, 
nevertheless. We have NF{0,0) = 1 and NF{j,k) = 0, if j < k. In order 
to compute the values for A; > 1 and j > k, we will consider every number g 
of values x from the set {l,...,j} for which f[x) = k. Once g is fixed, we 
have C{j,g) ways of choosing the g values from the set {1, . . . , j}. For each 
such possibility we have NF{j — g,k — 1) ways of extending it to a surjective 
function. Thus, NF{j, k) = J^l^i C{j,g) ■ NF{j -g,k-l). We can tabulate 
all the NF{*, *) values in 0{n^) time (after tabulating the combinations C{*, *) 
in O(n^) time, first). With the NF{*,*) values computed, we have NT{i,j) = 
j)-Y^k=i k)-NF(j, k)). We can easily compute each entry NT{i,j) 

in 0{n) time, obtaining an 0{n'^) overall time complexity. The technique of 
performing dynamic programming on successive layers of leaves of a tree is also 
useful in several other counting problems. 

8.3 The Number of Trees with Degree Constraints 

We want to compute the number of unlabeled, rooted trees with n > 2 vertices, 
such that the (degree / number of sons) of each vertex belongs to a set S, which 
is a subset of {0, 1, 2, . . . , n — 1}. By (a/b) we mean that a refers to the degree- 
constrained problem and b refers to the number-of-sons-constrained problem 
(everything else being the same). Because every tree with n > 2 vertices must 
contain at least a leaf (a vertex of degree 1) and at least one vertex with at least 
1 son, the set S will always contain the subset ({1}/{0,1}). We will compute 
a table NT{i, j,p)=the number of trees with i vertices, such that the root has 
degree j {j sons) and the maximum number of vertices in the subtree of any son 
of the root is p; moreover, except perhaps the tree root, the (degrees/numbers 
of sons) of all the other vertices belong to the set S. Because the trees are 
unlabeled, we can sort the sons of each vertex in non-decreasing order of the 
numbers of vertices in their subtrees. Thus, we will compute the table NT in 
increasing order of p. A^T(l,0,p) = 1 andiVT(l,j > 0,p) = NT{i > 2, j,0) = 0. 
For p > 1 and i > 2, we have: 
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NT{i, J, p)=NT{i,j, p - 1)+e[Ji ^ NT{i -k-p,j -k,p-l)- CR{TT{p) , k) 
TT{p) is the total miinbcr of trees with p vertices, for which the (degree 
/ number of sons) of the root is equal to some {{x — l)/(x)), x £ S, and the 
(degrees / numbers of sons) of the other vertices belong to the set S. By CR{i,j) 
we denote combinations with repetitions of i elements, out of which we choose 
j. Because the argument i can be very large, we cannot tabulate CR{i,j). 
Instead, we will compute it on the fly. We know that CR{i,j) — C{i + j — 
and that C{i,j) = M±i • C(i,i - 1). Thus, CR{i,j) can be computed in 0{j) 
time. Before computing any value NT{*,*,p), we need to compute and store 
theyaluesTTip), TT{p) = J:^^s NT{p,i{x-l)/ix)),p-l), 'diid CR{TT{p),k), 

for all the values of A; (1 < fc < ). We can compute all of these values 

in 0{n^ ■ log{n)) time. The desired number of trees is X^3;g5 -'VT(n, a;, n — 1). 
The memory storage can be reduced from O(n^) to O(n^), by noticing that the 
values NT{*, *,p) are computed based only on the values NT{*, *,p— 1). Thus, 
we can maintain these values only for the most recent two values of p. 

A less efficient method is to compute the numbers Tok{i)=tlic number of 
trees with i vertices, such that each vertex satisfies the (degree/number of sons) 
constraints. Tok{l) = Tok{2) = 1. We will make use of the TT{i) values 
defined previously, except that they will be computed differently. For every 
i > 2, we consider every possible number x of sons of the tree root and compute 
NT2(i, x)=the number of trees with i vertices, such that the tree root has x sons 
and all the other vertices satisfy the (degree/number of sons) constraints. We 

will generate all the possibilities (j/(l), 7/(2), y(i — 1)), with < y(j) < ^y!-J 

(1 < j < i — and y(l) + . . . + y{i — l)=x. y{j) is the number of sons of the tree 
root which have j vertices in their subtrees. The number of trees "matching" 
such a partition is equal to Ylj^i CR(TT{j),y{j)). NT2{i,x) is computed by 
summing the numbers of trees "matching" every partition. Afterwards, ifx €: S, 
we add NT2{i,x) to Tok{i). U x = {{y - l)/{y)) and y G S, then we add 
NT2{i,x) to TT(i). NT2{i,x) may be added to both Tok(i) and TT{i). 



9 Related Work 

Reliability analysis and improvement techniques for distributed systems were 
considered in [6,7]. Reliability analysis and optimization for tree networks in 
particular were considered in [3,5,8]. Diff'erent kinds of tree partitioning algo- 
rithms, based on optimizing several objectives, were proposed in [9,10,16]. Prob- 
lems related to tree coloring were studied in [4] . Content delivery in distributed 
systems is a subject of high practical and theoretical interest and is studied 
from multiple perspectives. Communication scheduling in tree networks was 
considered in many papers (e.g. [17]) and the optimization of content delivery 
trees (multicast trees) was studied in [11]. 



10 Conclusions and Future Work 

In this paper I considered several optimization problems regarding distributed 
systems with tree topologies (e.g. peer-to-peer networks, wireless networks, 
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Grids), which have many practical apphcations: minimuin weight cycle com- 
pletion (reliability improvement), constrained partitioning (distributed coordi- 
nation and control), minimum number of streams and degree-constrained mini- 
mum spanning trees (efficient content delivery), optimal matchings (data repli- 
cation and resource allocation), coloring (resource management and frequency 
allocation) and tree counting aspects. All these problems are variations or exten- 
sions of problems which have been previously posed in other research papers. 
The presented techniques are either better (faster or more general) than the 
previous solutions or easier to implement. 
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